Mdl Multiple Hypothesis Testing

نویسندگان

  • Enes Makalic
  • Daniel Schmidt
چکیده

This paper examines the problem of simultaneously testing many independent multiple hypotheses within the minimum encoding framework. We introduce an efficient coding scheme for nominating the accepted hypotheses in addition to compressing the data given these hypotheses. This formulation reveals an interesting connection between multiple hypothesis testing and mixture modelling with the class labels corresponding to the accepted hypotheses in each test. An advantage of the resulting method is that it provides a posterior distribution over the space of tested hypotheses which may be easily integrated into decision theoretic post-testing analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An MDL Approach for Multiple Low Observable Track Initiation∗

In this paper the track initiation problem is formulated as multiple composite hypothesis testing using maximum likelihood estimation with probabilistic data association (ML-PDA), an algorithm known to work under very low SNR conditions. This algorithm does not have to make a decision as to which measurement is target originated. The hypothesis testing is based on the minimum description length...

متن کامل

P 4: The Hypothesis Detect Multiple Sclerosis in Early Stage with Saliva Testing

Introduction: Recent studies point to the clinical and research efficacy of saliva as a respected diagnostic aid for observing Multiple Sclerosis. The objectives of this Hypothesis are to identify novel biomarkers recognized to Multiple Sclerosis in early stage in saliva and to determine if the levels of these markers correlate with level of these Cerebrospinal fluid and blood assays and urine ...

متن کامل

Hypothesis Selection and Testing by the MDL Principle

The central idea of the MDL (Minimum Description Length) principle is to represent a class of models (hypotheses) by a universal model capable of imitating the behavior of any model in the class. The principle calls for a model class whose representative assigns the largest probability or density to the observed data. Two examples of universal models for parametric classesM are the normalized m...

متن کامل

Minimum description length methods of medium-scale simultaneous inference

Nonparametric statistical methods developed for analyzing data for high numbers of genes, SNPs, or other biological features tend to overfit data with smaller numbers of features such as proteins, metabolites, or, when expression is measured with conventional instruments, genes. For this medium-scale inference problem, the minimum description length (MDL) framework quantifies the amount of info...

متن کامل

Logic Program Induction using MDL and MAP: An Application to Grammars

Probabilistic programs provide an appealing language for describing mental theories, because they are Turing complete: any computable process may be described as a program. Program induction is the problem of inferring theories, in the form of (probabilistic) programs, that describe some set of observations. Minimum Description Length, or MDL, is one common approach to program induction [11]. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011